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Nucleotide Sequences Codikg For Cryptosporidium Proteins, 
Polypeptides coded by said sequences and kits for the use thereof 

Description 

The invention concerns nucleic acids and 
5 polypeptides coded by said nucleic acids, derived from 
protozoan parasites of Cryptospoxidlxxm genus. Nucleic 
acids and peptides are advantageously used for 
developing detection assays of Crypzosporidium in 
biological samples of human and animal origin and/or in 

10 the environment. 

Cryptosporidium parasites infect the intestinal 
tract of several animal species • Over the last decade 
the number of infections in humans has dramatically 
increased. Most of the affected patients show a marked 

15 immxxnodef iciency, with a high incidence of AIDS. The 
immunocompromised patients develop a severe and 
frequently irreversible diarrhoea which causes 
malnutrition and represents a major factor leading to 
death, 

20 The prophylaxis of Cryptosporidium infections 

is hampered by the lack of reliable immunoassays for 
the detection of the parasite. Moreover, by microscopic 
examination, parasite oocyst are difficult to 
discriminate from several microoganisms that are 

25 morphologically similar to Crypzosporidium, like 
Candida species. 

The development of immunological and molecular 
diagnostic assays highly specific for Cryptosporidium 
" absolutely requires the biochemical characterisation 

30 and localization of parasite antigen?=; and the cloning 
of corresponding genes. The availability of diagnostic 
assays would pemit to develop prophylactic measures 
for immunodef icient patients by detecting the parasites 
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in the environment (food/ water) and in individuals 
(relatives, nurses) that may act as carriers • 

Moreover, no effective therapeutic compounds 
against Cryptosporidium infection are available; 
5 therefore there is the need to develop reagents able to 
prevent the onset of the infection in immunodef icient 
patients . 

The authors of the present invention have 
obtained a rabbit antiserum raised against a whole 

10 lysate of Cryptosporidium oocysts. The serum has been 
used to screen a genomic library of Cryptosporidium 
from infected intestinal mucosal cells, into the 
expression vector Xgtll* Among clones isolated, clone 
cpRL3 has been shown to comprise a 2359 bp (SEQ ID 

15 No.l) insert with an open reading frame coding for a 
polypeptide of 786 amino acids (SEQ ID No»2) . Scanning 
of the 67.0 version of the "GENE BANK"" data base using 
the cpRLB sequence failed to reveal any similarity with 
known DNA sequences. 

20 The cpRL3 insert has been subcloned into an 

expression plasmid and the corresponding recombinant 
polypeptide is produced in E. coli^ fused at the 
N-terminus to a stretch of six histidines. The 
histidine sequence allows a fast and efficient 

25 purification by nickel chelate chromatography of the 
polypeptide encoded by cpRL3 . The recombinant 
polypeptide is used as immunogen in Balb/c mice for 
producing antisera and monoclonal antibodies. The 
recombinant protein is highly immunogenic. This 

30 sequence codes for a portion of a protein of the oocyst 
wall of Cryptosporidixom that has an apparent molecular 
weight of 190,000 Dalton, named "Cryptosporidium Oocyst 
Wall Protein" (COWP) . 

Nucleotide and polypeptide sequences derived 

35 from the isolated sequence are advantageously utilized 
for diagnosis of Cryptosporidium infections in patients 
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and/or animals and for analysis of environmental 
contamination by Cryptosporidium oocysts. Treatment of 
' Crypt osporidx am infection may be obtained by 
administration of antibodies raised against COWP, 
5 The invention concerns also: - designing of 

specific oligonucleotides, as primers in polymerase 
chain reactions (PGR) reactions for the detection of 
the COWP DNA sequence; - the utilisation of COWP amino 
acid sequence or fragments thereof to raise antisera 

10 and/or monoclonal antibodies for the development of 
immunological assays to detect the presence of 
Cryptosporidium and/or COWP molecule; - the use of 
COWP derived polypeptides, either synthetic or 
recombinant, as components of diagnostic kits for the 

15 detection of COWP released by parasites; - the 
utilisation of COWP-derived polypeptides, either 
synthetic or recombinant, for producing antisera and/or 
monoclonal antibodies to be employed in the therapy of 
Cryp tosp ori diosis. 

20 It is a specific object of the present 

invention a polypeptide in a substantially purified 
form comprising a contiguous sequence coded by a 
Cryptosporidium gene, said gene comprising a nucleotide 
sequence at least SO % homologous to the sequence of 

25 SEQ ID No.l. Preferably said contiguous sequence 
comprises an antigenic determinant of Cryptosporidium, 
More preferably said contiguous sequence is coded by 
the sequence of SEQ ID No.l, or parts thereof; most 
preferably said contiguous sequence is comprised in the 

30 aminoacid sequence of SEQ ID No. 2. 

It is another object of the invention a 
diagnostic kit for the detection of Cryptosporidium in 
biological and environmental samples comprising, as 
specif ligand/ the polypeptide according to the 

35 invention. 
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It is another object of the invention the use 
of a polypeptide according to the invention for raising 
'antibodies able to detect Cryptosporidium infection in 
biological and environmental samples, 
5 It is another object of the invention an 

antibody obtained using as immunogen a polypeptide 
according to the invention. 

It is another object of the invention a 
diagnostic kit for the detection of Cryptosporidium in 

10 biological and environmental samples comprising, as 
specif ligand/ an antibody able to react with at least 
one polypeptide according to the invention. 

It is another object of the invention an 
oligonucleotide derived from a Cryptosporidium gene, 

15 said gene comprising a sequence at least 50 % 
homologous to the sequence of SEQ ID No.l, Preferably 
said oligonucleotide has a sequence comprised in the 
sequence of SEQ ID No,l/ or in the complementary 
sequence of SEQ ID No.l, 

20 It is another object of the invention a 

diagnostic kit for the detection of Cryptosporidium in 
biological and environmental samples comprising, as 
specif ligand, the oligonucleotide according to the 
invention. 

25 It is another object of the invention a PGR kit 

for the amplification of Cryptosporidium DNA 
comprising, as specif primer, at least one 
oligonucleotide according to the invention. Preferably 
said PGR kit comprises two oligonucleotides having 

30 nucleotide sequences according to the invention. 

The invention will be illustrated in the 
following examples, by making reference to the 
following figures, wherein: ^ 
- Figure 1 shows an immunoblot analysis of the 

35 mouse serum (MlO/01) (A) and of a control mouse serum 
anti TRAP (B) against: the expression product of the 
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control pXasmid pDS56/RBSII~E-6his-TRAP purified on 
nickel column (1) ; the expression product of the 
'plasmid pDS56/RBSII-E^6his-cpRL3 purified on nickel 
coluinn (2) ; a protein lysate of E. coll cells 
5 transformed with pDS56/RBSII-E^6hiS"cpRL3 induced with 
IPTG (3); and non induced (4); molecular weight 
standard are indicated. 

- Figure 2 shows an immunoblot analysis using 
the mouse serum MlO/01 (A) and the MTUdI IB2 (B) against 

10 a protein lysate of C* parwm oocysts. 

- Figure 3A shows an electrophoresis of PGR 
products amplified from C. parvum DNA (1; 4) and from 
DNA of the plasmid pDS56/RBSII-E-6his-cpRL3 (2; 5) using 
primers combinations Cry-3/Cry-6 (1;2) or Cry-S/Cry-S 

15 (3;4;5;). As control, PGR reaction without templare DNA 
(3) • Figure 3B shows an electrophoresis of PGR products 
amplified from DNA of several parasite species using 
primer combination Cry-S/Cry-G : DNA extracted from: C, 
parwm (1) , Sarcocystis sp. (2), Giardia lamblxa (3) 

20 and P. falciparum (4) . Figure 3G shows an 
electrophoresis of PGR products amplified from DNA of 
progressively diluted C. parvum oocysts using the 
primer combinations Cry-S/Cry-o: molecular standards 
(1), 160 oocysts (2), 80 oocysts (3), 40 oocysts (4), 

25 20 oocysts (5) 10 oocysts (6) , As control, PGR reaction 
is done without template DNA (7) or in the presence of 
P. falciparum DNA (8) . In panels A, B and C, lanes 6, 5 
and 1 respectively, DNA markers are 3611; 1166; 606; 
517; 396; 318; 263 bp. 

30 Example 1 X gtll library with DNA extracted from C> 
parvum infected calf intestinal mucosa 

To develop a parvum genomic expression 

library, DNA extracted from the intestinal mucosa of an 
infected calf is used* 

35 A newborne calf is infected with 6 x 10^ 

oocysts of C, parvum MI ISS-1 (Pozio et al. 1992. 
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Trans. R. Soc. Trop, Med* Hyg. 86:636-638), After 5 
dayS/ the gut is opened, cut into segments of 30 cm 
'each and washed in phosphate-buffered saline (PBS), 
Nitro-cellulose filters (soaked in PBS) of the same 
5 size as the gut segments are applied to the mucosal 
side for a few seconds. Filters are progressively 
numbered and processed for DNA extraction. A small 
sample is removed from each filter and analysed by 
microscopy to determine whether parasites have been 

10 removed. The sample from each filter is incubated with 
1% glutaraldehyde in cacodylate buffer for 2 h. The 
filters are dehydrated through an increasing ethanol 
series^ embedded in Epon, and cured at 60**C for 24 h. 
Sections are cut at 0.2-/iim thickness and stained with 

15 toluidine blue. The analysis reveals that nitro- 
cellulose filters remove only the superficial layer of 
mucosal cells, together with a large number of 
parasites • 

Only DNA extracted from filters that have 

20 removed a large nxamber of parasites are used. DNA is 
digested with EcoRI and cloned in Xgtll EcoRI digested, 
3' end to the coding sequence of j3~galactosidase gene. 
Phage DNA with cloned inserts is packaged in vitro 
{Boehringer Mannheim in vitro packaging kit) to 

25 generate the library. The quality of the library is 
evaluated by analysing the sizes of a subset of inserts 
by polymerase chain reaction (PCR) with 
oligonucleotides corresponding to the flanking 
sequences of the EcoRI site of the )3-galactosidase 

30 gene. The library has a complexity of 4.5 X 10^ plaques 
and an estimated average insert size of 1,800 bp. 

The expression library is analysed by use of 
a rabbit serum developed against purified oocysts of C. 
parvnm HI ISS-1. The seriam is used after removal of the 

35 background reactivity by several absorptions on filters 
soaked with bacterial and phage lysates. Specific 
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antibodies bound to filters are detected by use of a 
second anti-rabbit antibody conjugated to alkaline 
'phosphatase. An insert, named cpRL3, consisting of a 
2,359-bp open reading frame encoding a polypeptide of 
5 786 amino acids ; is isolated. The lack of both a start 
codon and a stop codon indicates that the sequence 
represents part of the coding sequence of the isolated 
parasitic gene* 

Example 2 Expression of the cpRL3 sequence in coll 
10 The DNA insert cpRL3 is cloned in the EcoRI 

site of plasmid pDS56/RBSII-E* 6xHis (a pDS56/RBSII 
derived plasmid containing an EcoRI site in the 
polylinker) . The expression unit of this vector is 
under the control of an isopropyl-/3- 
15 thiogalactopyranoside ( IPTG) -inducible promoter and 
yields a fusion between a stretch of 6 histidines and 
the amino terminus of the inserted seqaences (Stxiber et 
al. 1990, Eur. J- Immunol 20:819-824), cpRLS is 
expressed in E. coli Ml 5 carrying the lac 
20 represser-producing plasmid pUHAl . Induction is 
performed in LB medium for 4 h at ST'^C; 1 mM IPTG is 
added when the cell density reaches an optical density 
at 600 nm of 0 6. 

Example 3 Purification of recombinant polypeptide 6x 

25 His-cpRL3 

The expression product of the cpRL3 sequence 
(recombinant polypeptide 6xHis-cpRL3) is purified in a 
single-step procedure, by nickel chelate affinity 
chromatography (Stuber et al., ibid.). In brief, one 

30 litre of an induced culture of M:i5(pUHAl) cells 
carrying plasmid pDS56/ElBSII-E-6xHis-cpRL3 is harvested 
and stirred for 3 h in 100 ml of 6 M guanidine 
hydrochloride, 100 mM Na2HP04/ pH 8. The suspension is 
centrifuged at 10,000 x g, and the supernatant is 

35 directly applied to a nickel column (NTA-resin, 
Diagen) . After an equilibration step with 8 K urea, 100 
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mM NaH2P04/ 10 mM Tris, pH 8, 6xHis"cpRL3 is eluted by 
lowering the pH of the urea solution stepwise to pH 4 . 
'From one litre of culture, 1 mg of 6xHis--cpRL3 is 
obtained* 
5 Example 4 Immunoblotting 

Parasite lysates are obtained from 
Cryptosporidium oocysts purified by Percoll gradient 
centrifugation (Waldman, E., et al. 1986. J, Clin. 
Microbiol. 23:199-200) . Parasites are lysed by 

10 incubation of pellet of 2 x 10*7 oocysts with 0.2 ml of 
sample buffer (33 mM Tris-HCl, pH 6.8, 190 mM glycerol, 
0.1% SDS) . Bacterial lysates are obtained by treatment 
of 10^ induced or non induced E, coli cells with 1 ml 
of sample buffer. Proteins in total cell lysates are 

15 separated by SDS-polyacrylamide gel electrophoresis 
(PAGE) (Laemmli.U.K. 1970. Nature (London) 227:680-685) 
and electroblotted onto nitro-cellulose filters 
(blotting buffer, 25 mM Tris, 192 mM glycine, 20% 
methanol) . Non-specific adsorption of antibodies to the 

20 nitro-cellulose is prevented by saturation of the 
filters with 1% bovine serum albumin in 2x TEST (20 mM 
Tris-HCl, pH 8, 300 inM NaCl, 0.1% Tween 20) for 2 h at 
room temperature. Nitrocellulose filters are incubated 
with antibodies for 2 h at room temperature. After 

25 extensive washing with 2x TBST, antibodies bound to the 
filters are detected by use of goat anti-mouse 
immunoglobulin (heavy and light chains) conjugated to 
alkaline phosphates (Promega) . Phosphatase activity is 
disclosed by incubation of the filters with 0.3 mg of 

30 Nitro Blue Tetrazolium and 0.15 mg of 5-bromo 
4-chloro-3-indolyl phosphate per ml in 100 mM Tris-HCl 
(pH 9.5) -100 mM NaCl-5 inM MgCl2. 
Example 5 Monoclonal antibody production 

After purification by nickel chelate 

35 chromatography, recombinant polypeptide 6xHis-cpRL3 is 
used as immunogen to develop specific antisera and 
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monoclonal antibodies. BALB/c mice are immunised three 
times with 50 /xg of purified 6xHis-cpRL3 polypeptide in 
'complete (for the first immunisation) or incomplete 
Freund's adjuvant. Five days after the last 
5 immxinisation^ mouse spleen cells are fused with x63 Ag 
8653 myeloma cells and subsequently screened for 
antibody production (Kohler,G, and C. Milstein 1975 
Nature (London) 256:495-497). 

The supernatants of cultures from growing 

10 hybrids are tested in an enzyme-linked immunosorbent 
assay (ELISA) against 6xHisCpRL3* Immunised mice 
develop an antibody titer of 1/500,000. Figure 1 {A and 
B) shows the ability of one of the antisera to 
recognise specifically by immunoblot the recombinant 

15 polypeptide encoded by cpRL3 . The antiserum recognises 
the recombinant polypeptide in a total lysate of 
bacteria in which the expression of the cpRL3 insert is 
induced with IPTG (Fig. lA, lane 3) . The antiserum also 
recognrize the polypeptide after purification by the 

20 nickel chelate chromatography. The specificity of the 
reaction is demonstrated by the lack of reactivity 
against E. coli proteins / as well as against an 
unrelated recombinant protein, TRAP, expressed from the 
control plasmid pDSS6/RBSII-6xHis (TRAP) , also sharing 

25 the six histidine amino- terminal tail* Antibodies 
developed against the expression product of insert 
cpRL3 (mouse antiserum and monoclonal antibody 2B11) 
detect, in the lysate of Cryptosporidium oocysts, a 
protein of the apparent molecular weight of 190,000 

30 Dalton, (Figure 2 A and B) . These results indicates 
that the expression product of the insert cpRL3 is part 
of a Clyptosporidium protein expressed in the parasite 
oocysts . 

Example 6 Immunofluorescence microscopy 
35 Purified parasite oocysts are air dried on a 

coverslip and fixed in cold acetone for 5 min. Non- 



wo 94/04681 



PCr/IT93/00089 



10 



specific .binding is prevented by pre-incubation of the 
samples in PBS containing 1% bovine albumin. Primary 
antibodies (culture supernatant) are allowed to react 
for 40 min at room temperature, and the secondary 
5 f luorescinated antibody (Becton Dickinson goat 
anti-mouse) is allowed to react for 20 min. Observation 
of the samples is carried out with confocal microscopic 
apparatus (Bio-Rad Laboratories) * 

The protein is localised by immuLnofluorescence 

10 using the monoclonal antibody 2B11 on a preparation of 
oocysts fixed with acetone. The monoclonal 2B11 
specifically binds to a protein of oocyst wall of 
Cryptosporidium. The result of the iromiinclocalization 
is confirmed by confocal immunofluorescence analysis 

15 which shows that the reactivity is detected on the 
surface of the oocyst wall. On the basis of these 
observations, the Cryptosporidium protein encoded by 
the insert cpRL3 is named as Cryptosporidium oocyst 
wall protein, COWP. 

20 By immunofluorescence, monoclonal antibodies 

can identify as few as 100 oocysts/ml of stools. The 
detection limit of this method is determined 
resuspending known numbers of oocysts in samples of non 
infected stools. 

25 Example 7 ELISA assays 

Monoclonal antibodies produced against the 
product of the insert cpRL3 are able to recognise COWP 
in solution when used in ELISA assays based on antigen 
capture with two antibodies. Therefore specific 

30 antibodies directed against this protein or fragments 
thereof having an antigenic activity, as the product 
of cpRL3, can be employed to develop diagnostic assay 
to reveal Cryptosporidium infection. 
Example 8 PCR 

35 For all PCR experiments, the cpRL3 sequence is 

amplified in a standard 50 /xl PCR reaction mixture 
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(Saiki, R K., et al. 1988. Science 239:487-491; Scharf, 
S, J. et al. 1986. Scieace 233:1076-1078) for 35 cycles 
*at 94,5'*C (1 min) , SB'^C (30 sec); 72*^0(1 min) , with a 
Lab Line thermal cycler. The final concentration of 
5 MgCl2 is 2 mM. Primers used are: 
Cry 3 { 5 ' GTCCTACTGGATTCACTCTAC- 3 ' ) 
coding strand, nt* 722-742 of SEQ ID No . 1; 
Cry 5 ( 5 ' -CCAGGACATCATCATGGTCATTCTCATGGGC-3 ' ) 
coding strand, nt. 1099-1129 of SEQ ID No, 1; 

10 Cry 6 (5'-CCGaATATGTAACACATTTATCCGC-3M 

non coding strand, nt. 1828-1852 of complementary 
strand of SEQ ID No* 1. 

For amplification of the cpRL3 sequence from 
Cryptosporidium oocysts, samples are incubated for 5 

15 min under reducing conditions and boiled for 10 min 
thereafter. TaqI polymerase is purchased from 
Perkin-Elmer Co* 

The ability of two oligonucleotide corabinations 
to amplify by PGR the sequence of cpRL3 is shown in 

20 Figure 3A. Both of oligonucleotide combinations 
Cry3/Cry6 and Cry5/Cry6 are able to amplify DNA 
segments of the expected molecular weight from the 
plasmid pDS56/RBSII-E-6xHis (cpRL3) and from DNA of 
CryptospoTldium. The amplification reaction using 

25 oligonucleotides Cry3/Cry6 is highly specific, (figure 
3B) . Oligonucleotides in fact do not amplify any DNA 
segment when DNA of other protozoa (Giardia lamblia^ 
Plasmodium falciparum o Sarcocystls suls homlnis) is 
used as template. In PCR experiments the 

30 oligonucleotide combination Cry3/Cry6 is able to 
amplify a specific DNA segment from as few as 40 
oocysts of Cryptosporidium, (Figure 3C) . These data 
indicate that by using combination of oligonucleotides 
corresponding different regions of cpRL3 sequence, PCR 

35 can be employed to detect the presence of 
Cryp tosporldlum. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Universita' degli Studi di Roma "La 
Sapienza" 

(B) STREET: P.le Aldo Moro 5 

(C) CITY: Rome 

(E) COUNTRY: Italy 

(F) POSTAL CODE (ZIP) : 00185 

(A) NAME: Istituto Superiore di Sanita' 

(B) STREET: V.le Regina Elena 299 

(C) CITY: Rome 

(E) COUNTRY: Italy 

(F) POSTAL CODE (ZIP) : 00161 

(ii) TITLE OF INVENTION: Nucleotide sequences coding for 
Cryptosporidium proteins, polypeptides coded by said 
sequences and kits 

(iii) NUMBER OF SEQUENCES: 2 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentin Release #1-0/ Version #1.25 

(EPO) 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2359 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNE SS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..2359 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

^ TTC ^ TGC CC^ CC^ G=T AC^ ATT TT. ^ GAT =AT CAA TGT CAA « 

Glu Phe Glu Cys Pro Pro Gly Thr lie hen i^ys ^t^n ^ 
I 5 1° 

TCG ATA GAA AGA GTT GAT ACA ATT TGT CCA CCA GGG TTT GTA GAT AAT 96 
Ser lie Glu Arg Val Asp Thr He Cys Pro Pro Gly Phe Val Asp Asn 

20 2^ 

rrc GAA GAT TGT GTC CAA TTT TCT GCA CCA GAG AAA ATT TGC CCC CAA 144 
Gl5 G^ Sp cys vTl Gin Phe Ser Ala Pro Glu Lys lie Cys Pro Gin 

35 

r-ra TTT TCT CIT TCC GGR MA CM. TGT GTI AAA ACA GAA TCT GCT CCA 192 

^ III III III ser Gly Lys Gin Cys Val Lys Thr Glu Ser Ala Pro 

50 

ArA TTA ACA GAA TGC CCA CCA GGT ACA ACC TTG GAA AAT AAC AGT TGT 240 

^ llu tS CVS pro pro Gly Thr Thr Leu Glu Asn Asn Ser Cys 



^TT TCA TAT GAA CTA GAA GAT GCC ATT TGT CCA CCT GGA TAT CTC GAC 288 
He ser Tyr Glu Leu Glu Asp Ala He Cys Pro Pro ciy xyr 

85 

nr-^ TrT. rar •vcr GTT CAG TTT TCT CAA CCA GAA AAG GAG TGT CCA 336 
AAT GGA TCA GAC TGT GTT CAG ill p^^ 

Asn Gly Ser Asp Cys Val Gin pne ber ^±u c no 

100 

nrj. rrr TTT GTA TTA ATT GGA AAA CAA TGT ACC CAA ACT ACT CAA GCT - 384 
ACA GGT TTT GTA ii^ ^ ^ Ala 

Thr Gly Phe Val Leu He Gly Lys Gin i^ys lux. o 
115 120 

rCA CCA CAA CCA GAG TGT CCT CCA GGT ACA AAC CTG GTA AAT GGA CAA 4 32 
Pro Pro G^^ Pro Glu Cys Pro Pro Gly Thr Asn Leu Val Asn Gly Gin 
130 

, r^mm n^-^ nrr aTA AAT ATG GTA TGT CCA ACT GGT TTT ATT 480 
TGC CAA AAA GTT GAA AGG ATA AAT ATG GT 

CVS Gin Lys Val Glu Arg iie Asn r^ei- °- y 

150 -"-^^ 



145 



r-^r. ^T.r CCT ACA AAT TGT GCT TCT TTC TCC GCA CCA AAC AGA GAA TGC 528 
GAT AAT GGT ACA AAi ^^.u Cys 

Asp Asn Gly Thr Asn Cys Ala ser fne a«i 

165 

„..m ,v/-7v r-v-v TCT GGA TCC CAA TGC GAG CAA ATA AAA GAA 57 6 
CCA CCT GGA TAT ACA CTT TCT GGA ie.C 

pro Pro Gly Tyr Thr Leu Ser Gly Ser Gin Cys Glu Gin lie y 

180 
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rcT PTT TCA GAA TGT CCA CCA GGA TAT AAA CTT CAA GGA AAT 624 
GCA CCT CCT GTT TCA GAft. l^x q-^^ G3_y p^sn 

Ala pro Pro Val Ser Glu Cys Pro Pro Gly Tyr Lys 

195 200 

T^rvn rzvT rrT ATC TGC CCA GAT GGA TTT 672 

™' ^ ^ Met ill Sp Si lie cys Pro Gly Phe 

Gin cys Thr Ala Leu Lys wei^ xxt; ^^t^ 



210 



« ^mr. r^T^Ti TTT TCT CCT GCT TCA ACT GTA 720 
TTA CCA AAT GGA GAC GAT TGT ATC CAA TTT TCT CCT GC 
Leu pro Asn Gly Asp Asp Cys He Gin Phe ser fro 
225 230 

..X CCT ACT CC TTC ACT CT. CA. W C^ C.C TGT CTT ^ .CT 768 

Cys pro Thr Gly Phe Thr Leu Gin Asn Gxn v^x y 

2 4 5 



r^TVTi -rr-r rrx CCA GGT TCT GCG TTG GAT GGA 816 
ACC TCA CCA AAA ACA CCA GAA TGT CCT CCA GGl 

Thr ser Pro Lys Thr Pro Glu Cys Pro Pro Gxy 

260 



^ TCC TCC .C. ^ CTT CTT CCC CCC CCT CTT C^ T.C CTT TCT CCT S.. 
Asp ser Cys Thr Arg Leu Val Pro cxy 

275 

.XT GGT ACT AGA GAG GGG GAC GTT TGC GTA GAG AGA TCG ATT AGT TCG 912 

val Gly Thr Arg Glu Gly Asp Val Cys Val Glu Arg 

290 29b 

rrr TAT TCA TTG GAA ACA GGT AAA CAA 960 
CCT GTT TTG GAA TGC CCA CCT GGT TAT TCA TiG 

Pro Val Leu Glu Cys Pro Pro Gly Tyr Ser Leu ^^o 



305 



..T CTT ^. ^ .CC T.T ^ TCT TO. CT. .C. .CT T.T CTT .C. XCOS 

cys Val Arg Arg Ser Gin Tyr Asp ^ys 

OAC TGT AAA ACA CCT GAT GTT AAA GCA CTA AGA AGA TTA ^A GCT GCA 1056 
Glu cys Lys Thr Pro Asp Val Lys Ala Leu Arg Arg 

340 



^rj. TTT TAT GAA ACA TCT GAG ATA CAA AAT CCA GGA 1104 

e ^ ^ ^'^^ %i "° ' 

"^^aH -jw^ 



355 360 



^aT TPA CAT TCA CAA GTT ATA CCA 1152 
CAT CAT CAT GGT CAT TCT CAT GGG wAT TCA CAT TCA 
His His His Gly His Ser Hxs Gly nx^ ser nxs 
370 375 

™ ^^-Tv r-aa r-a.T TAT AAA GAG GCT CCA AGG 1200 
ATT CAA ACC CAG AAT ATA CAT ACA CAA CAT CAT ^ GAG 
He Gin Thr Gin Asn He Hxs Thr Gin Hxs hxs uy 



385 390 
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CCA ATT TGT GAA GAT GTT CCA AAA ATT ACC CCA AAA ACT TGT ACA AAA 1248 

Pro He Cys Glu Asp Val Pro Lys He Thr Pro Lys Thr Cys Thr Lys 

405 410 415 

GCT GAT TCT GTC CCA GCT GTG CCT ATT TGC GAG AAC AAT GCT GAA CTT 1296 

Ala Asp Ser Val Pro Ala Val Pro He Cys Glu Asn Asn Ala Glu Leu 

420 425 430 

GTA GGA AAA GAA TGT GTA TTA ACA AAT TAC TAG CCA TTA GTIA GCA ATT 1344 

Val Gly Lys Glu Cys Val Leu Thr Asn Tyr Tyr Pro Leu Glu Ala He 

435 440 445 

TGT CAA GAT GGA ACA AGA TCA AAA GAG TGT GCT AAG TTT GTA AAA ACT 1392 

Cys Gin Asp Gly Thr Arg Ser Lys Glu Cys Ala Lys Phe Val Lys Thr 

450 455 460 

CCA CCT ACT TTA AAA TGT CCG CCA GGT TCT GTA GAT GTA GGA TCT CAA 1440 

Pro Pro Thr Leu Lys Cys Pro Pro Gly Ser Val Asp Val Gly Ser Gin 

465 470 475 480 

TGT CAA GTT AAC AAA TAT TCA CCA TAT GAT CTT GCA TGC CCT GCA GGA 1488 

Cys Gin Val Asn Lys Tyr Ser Pro Tyr Asp Leu Ala Cys Pro Ala Gly 

485 490 495 

TAT GCA TTG GTT GGA GAC AAA TGC GCT ACC ACA AGA GAA AAA GTT TGC 1536 

Tyr Ala Leu Val Gly Asp Lys Cys Ala Thr Thr Arg Glu Lys Val Cys 

500 505 510 

CCG AAT GZUV AGT TGC CAA AGA GTT GTA ACT GCG CCT GTT TCT TTA ACT 1584 

Pro Asn Glu Ser Cys Gin Arg Val Val Thr Ala Pro Val Ser Leu Thr 

515 520 525 

TGT CCC CCT GGA TAT CAC CAA ATA GAT GAA GTT ATG AAT ATT TCT GCT 1632 

Cys Pro Pro Gly Tyr His Gin He Asp Glu Val Met Asn He Ser Ala 

530 535 540 

CAT CCA CAC CAC AGA CAC TTA GCT GGG GTT CAA TCT ACT TCT CAA AAG 168 0 

His Pro His His Arg His Leu Ala Gly Val Gin Ser Thr Ser Gin Lys 

545 550 555 560 

GGA TAT TCT CAT GGA CAT AAA TAT ACT CCT GTA ATT TCT CAG CCA CCA 1728 

Gly Tyr Ser His Gly His Lys Tyr Thr Pro Val He Ser Gin Pro Pro 

565 570 575 

CAA CCA GTT CCA GTT GTT GCT CCT ATT CAG CAA ATG AAA TGC ATC CAT 177 6 

Gin Pro Val Pro Val Val Ala Pro He Gin Gin Met Lys Cys He His 

580 585 590 

GCA AAC CAT GCT CCA TAT AAT CTT ATC TGT CCT GTT GGA TCA AGA CTT 1824 

Ala Asn His Ala Pro Tyr Asn Leu He Cys Pro Val Gly Ser Arg Leu 

595 600 605 
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GTA GCG GAT AAA TGT GTT ACA TAT TCG GAT AAA ATA TGT CCA AAT GGT 1872 

Val Ala Asp Lys Cys Val Thr Tyr Ser Asp Lys lie Cys Pro Asn Gly 

610 615 620 

AAT TGC GAG CGT ATA TAT AAT GAG CCT GCT GAA. TTA GTA TGC CCT CCA 192 0 

Asn Cys Glu Arg lie Tyr Asn Glu Pro Ala Glu Leu Val Cys Pro Pro 
€25 630 635 640 

GGA TTC TCA TCA TCT AAA CCA ATT CAG CCA ATA AGC CAT TCT CAT ATT 1968 

Gly Phe Ser Ser Ser Lys Pro lie Gin Pro lie Ser His Ser His lie 

645 650 655 

AAC CAT CCA AAT GTT TCT GTT CCC GTC CAA CCA CAA ACT ATT AAC CAA 2016 

Asn His Pro Asn Val Ser Val Pro Val Gin Pro Gin Thr lie Asn Gin 

660 665 670 

CCA CAA GTA ATT CAA CAA AGA CAA GTA AAT TAT CAG CCA CAA GTA ATT 2064 

Pro Gin Val lie Gin Gin Arg Gin Val Asn Tyr Gin Pro Gin Val He 
675 680 685 

CAT CAA ACA CAG GAA ATT TTA ACA ACT TAT CCA ACT CCA GTT TAC CAA 2112 

His Gin Thr Gin Glu He Leu Thr Thr Tyr Pro Thr Pro Val Tyr Gin 

690 695 700 

ACC GGC ACA ATT TAT CAA GGA CAT CAT CAT CAT CAT CAT CAT CAT CAC 2160 

Thr Gly Thr He Tyr Gin Gly His His His His His His His His His 
705 710 715 720 

AGA AAT CTA GCT TCC CCT GAG TGC ATT A?VG ACA ATT TCA GTA CCT TAT 2208 

Arg Asn Leu Ala Ser Pro Glu Cys lie Lys Thr He Ser Val Pro Tyr 

725 730 735 

ATT TTA AJVA TGC GAA TCT CCA TTT ATT TTA GAT GGC GAC APiA TGT ATC 2256 

He Leu Lys Cys Glu Ser Pro Phe He Leu Asp Gly Asp Lys Cys He 

740 745 750 

GAA AAA ACA GAA AAA ATT TGT CTA CAA GGT GAC TGC AGA AAA CAA GTC 2304 

Glu Lys Thr Glu Lys He Cys Leu Gin Gly Asp Cys Arg Lys Gin Val 
755 760 765 

GTC GTT CCA CCA ACT CTT TCA TGT CCA CAA GGT TAC AGA AAT GCC AAC 2352 

Val Val Pro Pro Thr Leu Ser Cys Pro Gin Gly Tyr Arg Asn Ala Asn 

770 775 780 



GGA ATT C 
Gly He 
785 



2359 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 78 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Glu Phe Glu Cys Pro Pro Gly Thr lie Leu Lys Asp Asp Gin Cys Gin 
15 10 15 

Ser lie Glu Arg Val Asp Thr lie Cys Pro Pro Gly Phe Val Asp Asn 

20 25 30 

Gly Glu Asp Cys Val Gin Phe Ser Ala Pro Glu Lys lie Cys Pro Gin 

35 40 45 

Gly Phe Ser Leu Ser Gly Lys Gin Cys Val Lys Thr Glu Ser Ala Pro 
50 55 60 

Arg Leu Thr Glu Cys Pro Pro Gly Thr Thr Leu Glu Asn Asn Ser Cys 
65 70 75 80 

lie Ser Tyr Glu Leu Glu Asp Ala lie Cys Pro Pro Gly Tyr Leu Asp 

85 90 95 

Asn Gly Ser Asp Cys Val Gin Phe Ser Gin Pro Glu Lys Glu Cys Pro 

100 105 110 

Thr Gly Phe Val Leu He Gly Lys Gin Cys Thr Gin Thr Thr Gin Ala 
115 120 125 

Pro Pro Gin Pro Glu Cys Pro Pro Gly Thr Asn Leu Val Asn Gly Gin 
130 135 140 

Cys Gin Lys Val Glu Arg He Asn Met Val Cys Pro Thr Gly Phe He 
145 150 155 160 

Asp Asn Gly Thr Asn Cys Ala Ser Phe Ser Ala Pro Asn Arg Glu Cys 

165 170 175 

Pro Pro Gly Tyr Thr Leu Ser Gly Ser Gin Cys Glu Gin He Lys Glu 

180 185 190 

Ala Pro Pro Val Ser Glu Cys Pro Pro Gly Tyr Lys Leu Gin Gly Asn 
195 200 205 
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Gin Cys Thr Ala Leu Lys Met lie Asp Ala lie Cys Pro Asp Gly Phe 
210 215 220 

Leu Pro Asn Gly Asp Asp Cys lie Gin Phe Ser Pro Ala Ser Thr Val 
225 230 235 240 

Cys Pro Thr Gly Phe Thr Leu Gin Asn Gin Gin Cys Val Gin Thr Thr 

245 250 255 

Thr Ser Pro Lys Thr Pro Glu Cys Pro Pro Gly Ser Ala Leu Asp Gly 

260 265 270 

Asp Ser Cys Thr Arg Leu Val Pro Gly Ala Leu Gin Tyr Val Cys Pro 
275 280 285 

Val Gly Thr Arg Glu Gly Asp Val Cys Val Glu Arg Ser lie Ser Ser 
290 295 300 

Pro Val Leu Glu Cys Pro Pro Gly Tyr Ser Leu Glu Thr Gly Lys Gin 
305 310 315 320 

Cys Val Arg Arg Ser Gin Tyr Asp Cys Ser Val Thr Thr Tyr Val Thr 

325 330 335 

Glu Cys Lys Thr Pro Asp Val Lys Ala Leu Arg Arg Leu Ala Ala Ala 

340 345 350 

Lys Glu Thr Ser Thr Val Tyr Glu Thr Ser Glu lie Gin Asn Pro Gly 
355 360 365 

His His His Gly His Ser His Gly His Ser His Ser Gin Val lie Pro 
370 375 380 

lie Gin Thr Gin Asn He His Thr Gin His His Lys Glu Ala Pro Arg 
385 390 395 400 

Pro He Cys Glu Asp Val Pro Lys He Thr Pro Lys Thr Cys Thr Lys 

405 410 415 

Ala Asp Ser Val Pro Ala Val Pro He Cys Glu Asn Asn Ala Glu Leu 

420 425 430 

Val Gly Lys Glu Cys Val Leu Thr Asn Tyr Tyr Pro Leu Glu Ala He 
435 440 445 

Cys Gin Asp Gly Thr Arg Ser Lys Glu Cys Ala Lys Phe Val Lys Thr 
450 455 460 

Pro Pro Thr Leu Lys Cys Pro Pro Gly Ser Val Asp Val Gly Ser Gin 
465 470 475 480 



wo 94/04681 



PCr/IT93/00089 



19 



Cys Gin Val Asn 



Tyr Ala Leu Val 

500 

Pro Asn Glu Ser 
515 



Cys Pro Pro Gly 
530 



His Pro His His 
545 



Gly Tyr Ser His 



Gin Pro Val Pro 

580 

Ala Asn His Ala 
595 

Val Ala Asp Lys 
610 

Asn Cys Glu Arg 
625 



Gly Phe Ser Ser 



Asn His Pro Asn 

660 

Pro Gin Val lie 
675 

His Gin Thr Gin 
690 

Thr Gly Thr lie 
705 



Arg Asn Leu Ala 



lie Leu Lys Cys 

740 



Lys Tyr Ser Pro 
485 

Gly Asp Lys Cys 



Cys Gin Arg Val 

520 

Tyr His Gin lie 
^3 3 ^5 

Arg His Leu Ala 
550 

Gly His Lys Tyr 
565 

Val Val Ala Pro 



Pro Tyr Asn Leu 

600 

Cys Val Thr Tyr 
615 

lie Tyr Asn Glu 
630 

Ser Lys Pro lie 
645 

Val Ser Val Pro 



Gin Gin Arg Gin 

680 

Glu lie Leu Thr 
695 



Tyr Gin Gly His 
710 



Ser Pro Glu Cys 
725 

Glu Ser Pro Phe 



Tyr Asp Leu Ala 
490 

Ala Thr Thr Arg 
505 

Val Thr Ala Pro 



Asp Glu Val Met 

540 

Gly Val Gin Ser 
555 

Thr Pro Val He 

570 

lie Gin Gin Met 
585 

He Cys Pro Val 



Ser Asp Lys He 

620 

Pro Ala Glu Leu 
635 

Gin Pro He Ser 
650 

Val Gin Pro Gin 
665 

Val Asn Tyr Gin 



Thr Tyr Pro Thr 

700 

His His His His 
715 



He Lys Thr He 
730 

He Leu Asp Gly 
745 



Cys Pro Ala Gly 
495 

Glu Lys Val Cys 
510 



Val Ser Leu Thr 
525 



Asn He Ser Ala 



Thr Ser Gin Lys 

560 

Ser Gin Pro Pro 
575 

Lys Cys He His 
590 



Giy Ser Arg Leu 
605 



Cys Pro Asn Gly 



Val Cys Pro Pro 

640 

His Ser His He 
655 

Thr He Asn Gin 
670 

Pro Gin Val He 
685 



Pro Val Tyr Gin 



His His His His 

720 



Ser Val Pro Tyr 
735 

Asp Lys Cys He 
750 
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Glu Lys Thr Glu Lys lie Cys Leu Gin Gly Asp Cys Arg Lys Gin Val 
755 760 765 

* 

Val Val Pto Pro Thr Leu Ser Cys Pro Gin Gly Tyr Arg Asn AlaAsn 
770 775 780 

Gly He 
785 
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Claims 

'1. A polypeptide in a substantially purified form 
comprising a contiguous sequence coded by a 
Cryptosporidium gene, said gene comprising a nucleotide 
5 sequence at least 50 % homologous to the sequence of 
SEQ ID No.l. 

2* A polypeptide according to Claim 1 wherein said 
contiguous sequence comprises an antigenic determinant 
of Cryptosporidium. 

10 3. A polypeptide according to any of previous claims 
wherein said contiguous sequence is coded by the 
sequence of SEQ ID No.l/ or parts thereof ♦ 
4* A polypeptide according to Claim 3 wherein said 
contiguous sequence is comprised in the aminoacid 

15 sequence of SEQ ID No. 2* 

5. A diagnostic kit for the detection of 
Cryptosporidium in biological and environmental samples 
comprising/ as specif ligand, the polypeptide according 
to the invention. 

20 6, Use of a polypeptide according to the invention 
for raising antibodies able to detect Cryptosporidium 
infection in biological and environmental samples . 
7. An antibody obtained using as immunogen a 
polypeptide according to any of Claims from 1 to 4 . 

25 8* A diagnostic kit for the detection of 
Cryptosporidium in biological and environmental samples 
comprising/ as specif ligand, an antibody able to react 
with at least one polypeptide according to any of 
Claims from 1 to 4 . 

30 9. An oligonucleotide derived from a Cryptosporidium 
gene, said gene comprising a sequence at least 50 % 
homologous to the sequence of SEQ ID No.l. 
10. An oligonucleotide according to Claim 9 having a 
secgfuence comprised in the seqruence of SEQ ID No»lf or 

35 in the complementary strand of SEQ ID No. 1. 
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11. A diagnostic kit for the detection of 

Cryptosporidium in biological and environmental samples 

comprising, as specif ligand, the oligonucleotide 
according to Claim 9 or 10. 

5 12. A PGR kit for the amplification of Cryptosporidium 
DNA comprising, as specif primer, at least one 
oligonucleotide according to Claim 9 or 10. 
13. A PCR kit according to Claim 12 comprising, as 
specif primer, two oligonucleotides according to Claim 
10 9 or 10, 
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